Representing and Querying Uncertain Data
نویسنده
چکیده
There has been a longstanding interest in building systems that can handle uncertain data. Traditional database systems inherently assume exact data and harbour fundamental limitations when it comes to handling uncertain data. In this dissertation, we present a probabilistic database model that can compactly represent uncertainty models in full generality. Our representation is associated with precise and intuitive semantics and we show that the answer to every user-submitted query can be obtained by performing probabilistic inference. To query large-scale probabilistic databases, we propose a number of techniques that help scale probabilistic inference. Foremost among these techniques is a novel lifted inference algorithm that determines and exploits symmetries in the uncertainty model to speed up query evaluation. For cases when the uncertainty model stored in the database does not contain symmetries, we propose a number of techniques that perform approximate lifted inference. Our techniques for approximate lifted inference have the added advantage of allowing the user to control the degree of approximation through a handful of tunable parameters. Besides scaling probabilistic inference, we also develop techniques that alter the structure of inference required to evaluate a query. More specifically, we show that for a restricted model of our probabilistic database, if each result tuple can be represented by a boolean formula with special characteristics, i.e., it is a read-once function, then the complexity of inference can be drastically reduced. We conclude the dissertation with a listing of directions for future work. To Mummy, Baba and Shiny, Dadu and Diddey, Amma and Dada, for always supporting me
منابع مشابه
Title of dissertation : REPRESENTING AND QUERYING UNCERTAIN DATA
Title of dissertation: REPRESENTING AND QUERYING UNCERTAIN DATA Prithviraj Sen, Doctor of Philosophy, 2009 Dissertation directed by: Professor Lise Getoor Department of Computer Science Professor Amol Deshpande Department of Computer Science There has been a longstanding interest in building systems that can handle uncertain data. Traditional database systems inherently assume exact data and ha...
متن کاملModeling, Querying, and Mining Uncertain XML Data
This chapter deals with data mining in uncertain XML data models, this uncertainty typically coming from imprecise automatic processes. We first review the literature on modeling uncertain data, starting with well-studied relational models and moving then to their semistructured counterparts. We focus on a specific probabilistic XML model, that allows representing arbitrary finite distributions...
متن کاملSimilarity Search and Mining in Uncertain Databases
Managing, searching and mining uncertain data has achieved much attention in the database community recently due to new sensor technologies and new ways of collecting data. There is a number of challenges in terms of collecting, modelling, representing, querying, indexing and mining uncertain data. In its scope, the diversity of approaches addressing these topics is very high because the underl...
متن کاملQuerying the Uncertain Position of Moving Objects
In this paper we propose a data model for representing moving objects with uncertain positions in database systems. It is called the Moving Objects Spatio-Temporal (MOST) data model. We also propose Future Temporal Logic (FTL) as the query language for the MOST model, and devise an algorithm for processing FTL queries in MOST.
متن کاملQuerying Uncertain Data in Resource Constrained Settings
Querying Uncertain Data in Resource Constrained Settings
متن کاملExtending dynamic queries to handle uncertain data
Dynamic querying is a technique which has been used successfully to enable novice users to gain access to and insight into data in databases. Some multimedia archives (such as archives of African art) contain data which have vague locations in time and space, that is, although there is some idea of when and where the entity originated, the precise information is unknown. This uncertainty create...
متن کامل